Chapter 1 - About Text Encodings and Conversions
This chapter introduces the Text Encoding Conversion Manager. As a prelude, it explains why text encoding conversion is necessary. Then it describes the Text Encoding Conversion Manager's two main components--the Text Encoding Converter and the Unicode Converter--suggesting why you should choose one over the other for your conversion processes. The remainder of the chapter explores some of the terms and concepts that pervade text encoding and the process of converting from one encoding to another, including
Finally, the chapter highlights the Text Encoding Conversion Manager package contents and gives a terse history of its past releases.
- Characters, codes, coded character sets, and character encoding schemes
- Text representation and text elements
- Text encoding specifications
- Unicode, in the context of its emergence as a solution to text encoding complexities
- Round-trip fidelity, strict and loose mapping, Corporate Use Zone mappings, and fallback mappings
You should read this chapter if you are developing
After reading this chapter, you should read the reference chapters that describe basic text types for specifying text encodings and other aspects of conversion, the Text Encoding Converter, and the Unicode Converter. The reference chapters are meant to be used as you develop your applications. Although this book doesn't include tutorial chapters, you can consult its descriptions of data structures and functions to gain a high-level understanding of how to use the converters.
- Internet-savvy applications, such as web browsers or e-mail applications.
- Applications that transfer text across platforms.
- Applications based in Unicode, such as a word processor or file system that operates in Unicode.
For general information about how the Mac OS handles text, consult Inside Macintosh: Text.
The Text Encoding Conversion Manager software can run on Mac OS System 7.1 or later. The converter libraries and associated files are installed by default as part of Mac OS 8 and as part of Mac OS Runtime for Java.
Chapter Contents
- Why You Need to Convert Text From One Encoding to Another
- Deciding Which Encoding Converter to Use
- The Text Encoding Converter
- The Unicode Converter
- Character Encoding and Other Concepts Fundamental to Text Encoding Conversion
- Characters
- Coded Character Sets
- Presentation Forms
- Character Encoding Schemes
- Text Encoding Specifications
- About Unicode and the Complexities of Conversion
- About Unicode
- ISO/IEC 10646
- Round-Trip Fidelity
- Multiple Semantics and Multiple Representations
- Strict and Loose Mapping
- Fallback Mappings
- Corporate Use Zone
- About the Text Encoding Manager Package
- About Earlier Releases
- Checking the Version
- Unicode Converter 68K Static Libraries